Visualizing tool for evaluating inter-label similarity in prosodic labeling experiments
نویسندگان
چکیده
This paper presents a technique that allows us to detect similarities among prosodic labels used to describe pitch accents within the ToBI framework. The inter-label proximity is determined empirically as a result of the evidence obtained in contingency tables of inter-transcriber agreement tests and in the confusion matrices used in automatic prosodic labeling experiments. This tool may be useful to decide which labels can be grouped together when a simplified representation is required.
منابع مشابه
Analysis of inter-transcriber consistency in the Cat_ToBI prosodic labeling system
A set of tools to analyze inconsistencies observed in a Cat_ToBI labeling experiment are presented. We formalize and use the metrics that are commonly used in inconsistency tests. The metrics are systematically applied to analyze the robustness of every symbol and every pair of transcribers. The results reveal agreement rates for this study that are comparable to previous ToBI inter-reliability...
متن کاملOn the use of a fuzzy classifier to speed up the Sp_ToBI labeling of the Glissando Spanish corpus
In this paper, we present the application of a novel automatic prosodic labeling methodology for speeding up the manual labeling of the Glissando corpus (Spanish read news items). The methodology is based on the use of soft classification techniques. The output of the automatic system consists on a set of label candidates per word. The number of predicted candidates depends on the degree of cer...
متن کاملUnsupervised prosody labeling for constructing Mandarin TTS
This paper introduces an unsupervised prosody labeling method for preparing a large speech corpus used in developing a Mandarin Text-to-Speech system. Adopting a four-layer prosody hierarchy, the proposed method performs an unsupervised segmental clustering that iteratively segments spoken utterances into strings of prosodic constituents and models the patterns of the segmented prosodic constit...
متن کاملAn Effective Approach for Robust Metric Learning in the Presence of Label Noise
Many algorithms in machine learning, pattern recognition, and data mining are based on a similarity/distance measure. For example, the kNN classifier and clustering algorithms such as k-means require a similarity/distance function. Also, in Content-Based Information Retrieval (CBIR) systems, we need to rank the retrieved objects based on the similarity to the query. As generic measures such as ...
متن کاملA Prosodic Labeling System for Mandarin Speech Database
A working database needs tools to transcribe and label at both phonetic and prosodic levels. While the proposed phonetic transcription system is a simplified from of the International Phonetic Alphabet (IPA) following the SAMPA guidelines; the prosodic labeling system is an elaborated form of the ToBI (Tone and Break Indices) framework adopted for Mandarin. In particular, the proposed prosodic ...
متن کامل